PLOS Genetics
Top medRxiv preprints most likely to be published in this journal, ranked by match strength.
Show abstract
Chromosome 5p15.33 harbors several independent association signals which demonstrate antagonistic pleiotropy across cancer types, with causal mechanisms largely unresolved. To identify functional variants and enhancer elements at this locus, we performed statistical fine-mapping followed by massively parallel reporter assays (MPRA) and proliferation based CRISPRi screens. This approach identified eight multi-cancer functional variants (MCFVs) across three GWAS signals. Targeting rs421629 (part o...
Show abstract
IntroductionGenome-wide association studies (GWAS) for kidney function have mainly focused on creatinine-based glomerular filtration rate (eGFRcrea), which is affected by variation in muscle mass. Moreover, the genetic basis of the sexual dimorphism of chronic kidney disease is underexplored. MethodsWe performed a GWA meta-analysis for creatinine clearance (CrCl), a muscle mass-independent kidney function phenotype, in 58,976 individuals of European descent from the Lifelines Cohort Study. Res...
Show abstract
Background: Quantitative genome wide association studies (GWAS) primarily rely on additive linear models that compare average phenotypic differences between genotype groups. While effective for detecting common variants of moderate effect in large sample sizes, such approaches inherently reduce high resolution phenotypic data to summary statistics (group averages), potentially limiting the detection of subtle genotype phenotype relationships. Genomic Informational Field Theory (GIFT) is a recent...
Show abstract
Rare Mendelian disorders affect 300-400 million people globally. Although genetic testing has become widely adopted, gene-specific evidence for tailored variant interpretation remains scattered across resources. We present Gene Portals, a framework for gene-centered multimodal knowledge bases that co-localize expert-harmonized clinical data, functional assays, population variation, structural annotations and gene-specific ACMG/AMP specifications within a single resource. A modular interface inte...
Show abstract
BackgroundA coronary artery calcium (CAC) score of 0 is widely considered to indicate low short- to intermediate-term risk for coronary artery disease (CAD) and is frequently used to defer lipid-lowering therapy. However, a subset of individuals with CAC=0 still experience events, highlighting residual risk not captured by imaging alone. Polygenic risk scores (PRS) quantify lifelong inherited susceptibility, but conventional approaches rely on predefined ancestry labels despite human genetic div...
Show abstract
Type 2 diabetes (T2D) affects 11.1% of the global population, underscoring the need for biomarkers that inform treatment response and glycemic outcomes. We evaluated the association between the FTO variant rs9939609-A and glycemic control in a Mexican population. A total of 174 individuals living with T2D from Merida and Sisal, Yucatan, were included, of whom 85% were receiving oral hypoglycemic agents as main treatment. Glycemic control was defined cross-sectionally as good ([≤]130 mg/dL, n=...
Show abstract
We report a previously undescribed genotypic configuration identified in twins with HNRNPU-related neurodevelopmental disorder. Both twins have two closely spaced mosaic variants on the same allele that never co-occur on any single DNA molecule, resulting in three distinct cell lineages within each individual. We define this genotypic configuration as clustered monoallelic mosaicism (cMoMa). Recognizing the extreme improbability of such a configuration, we systematically explore two potential me...
Show abstract
MotivationFanconi anemia (FA) is a rare disease mainly caused by biallelic pathogenic variants, including structural variants such as large deletions and insertions in FA genes. Currently, variant detection is based on short-read sequencing and probe-based approaches. However, determining the exact genomic breakpoint or achieving allelic discrimination remains challenging. Nanopore-based long-read sequencing enables a comprehensive detection of FA variants, but a unified bioinformatic analysis p...
Show abstract
Study question: How is glycodelin, a glycoprotein secreted by reproductive tissues, causally related to reproductive diseases and traits? Summary answer: We present evidence for a causal role of sex hormones in determining glycodelin levels, but limited evidence that glycodelin subsequently causally impacts reproductive traits. What is known already: Glycodelin is expressed in female and male reproductive tissues and has four glycoforms (-A, -C, -F and -S), with the glycosylation pattern determi...
Show abstract
BackgroundThe relationship between hip osteoarthritis (hip OA) and Alzheimers disease (AD) presents a critical paradox within the emerging "bone-brain axis": widespread phenotypic comorbidity sharply contradicts evolutionary theories of biological antagonism. This study integrates longitudinal and multi-omic analyses to determine whether this clinical overlap masks an underlying genetic neuroprotection. MethodsWe analyzed longitudinal phenotypic data from 261,767 UK Biobank participants using C...
Show abstract
Accurate classification of BRCA1 and BRCA2 variants is essential for cancer risk assessment and therapy selection, yet over one-third remain variants of uncertain significance (VUS). Here, using 120,660 real-world cancer genomic profiles with BRCA1 or BRCA2 variants from a >800,000-sample cohort, we develop machine learning models that predict pathogenicity using clinical and tumor-derived features, including a pan-cancer homologous recombination deficiency signature, co-mutated genes, zygosity,...
Show abstract
Tumour typing from whole-genome sequencing is increasingly accurate, yet molecular subtyping from somatic variants remains challenging because of tumour heterogeneity and inconsistent clinical annotations. Here, we present Mutation-Attention Dual-Task (MuAt2), a Transformer model that jointly classifies histological tumour types and subtypes directly from somatic single-nucleotide variants, indels and structural variants. MuAt2 leverages encoders pre-trained on 2,587 pan-cancer whole genomes, an...
Show abstract
Clear cell renal cell carcinoma (ccRCC) is the leading cause of kidney cancer-related death, but how the tumor microenvironment shapes patient survival is not completely understood. Here, we describe the characterization of ccRCC tumor ecosystems from 498 patients using imaging mass cytometry with a focus on tumor, myeloid, and T cell landscapes. Data from more than 3 million single cells is analyzed using machine-learning to identify key ecosystem features that outperform basic clinical data fo...
Show abstract
Biological ageing begins before birth, with early-life exposures shaping late-life health. These exposures drive health inequities early, yet specific exposures and the composition of the ageing exposome remain largely undefined. This gap may persist as the field lacks agnostic investigations accounting for non-linearity, interactions and subtle signals. We aimed to identify exposures predictive of epigenetic ageing accumulated during childhood and adolescence and explore the composition of the...
Show abstract
Acute kidney injury (AKI) and chronic kidney disease (CKD) are two interconnected clinical conditions, both defined by degree of functional impairment, but with heterogeneous clinical trajectories. Using new transcriptomic technologies, recent studies have described the cellular diversity in the healthy and injured kidney at the single cell level. Here, we used single nucleus transcriptomics to investigate the molecular diversity and commonalities in kidney biopsies from over 150 participants wi...
Show abstract
STUDY QUESTIONAre pathogenic variants in Homeodomain-interacting protein kinase (HIPK4) associated with sperm head abnormalities causing male infertility? SUMMARY ANSWERHIPK4 is a novel candidate gene associated with sperm head defects and human male infertility. WHAT IS KNOWN ALREADYNumerous genes causing male infertility due to Multiple Morphological Abnormalities of the sperm flagella (MMAF) have been described but the genetic basis of sperm head defects is less well understood. STUDY DESI...
Show abstract
Ambient air pollution has been associated with increased incidence of chronic disease and is estimated to contribute towards 4.2 million early deaths annually. Whilst the health impacts are well described, less is understood about the underlying biological mechanisms, particularly when considering the co-occurrence of multiple pollutants. Using an atmospheric chemistry transportation model (EMEP4UK), we generate pre-baseline sampling pollution exposure estimates for eight pollutants in Generatio...
Show abstract
Background and ObjectivesAssociations of common infections with Alzheimer disease (AD) risk have been reported. A hypothesized mechanism to explain these is cerebral amyloid-beta (A{beta}) aggregation as a defence in response to infection, with subsequent tau accumulation. However, few studies have assessed associations of infections with tau and A{beta} pathology. We investigated associations of serological measures of several common infections with plasma p-tau217 and A{beta} status measured b...
Show abstract
BackgroundNo randomized clinical trial comparing the most established new modalities of treatment for patients with localized prostate cancer has been published, and there is scarce comparative effectiveness research assessing Patient-Reported Outcome Measures (PROMs). Objectiveto compare the impact of active surveillance, robot-assisted radical prostatectomy (RARP), Intensity-modulated radiotherapy (IMRT), and real-time brachytherapy on patients, through PROMs, from pre-treatment to five years...
Show abstract
BackgroundKlebsiella pneumoniae is a common cause of neonatal sepsis in Africa, and is frequently hospital acquired. We recently reported an outbreak of multidrug-resistant K. pneumoniae sepsis amongst neonates at a rural hospital in The Gambia, West Africa, involving 57 cases and case fatality of 60%. Here we undertook a retrospective pathogen genomic epidemiology study of clinical and environmental K. pneumoniae isolated during the outbreak, to identify the outbreak strain, refine the epidemic...